Towards a Better Exploitation of the Brown 'Family' Corpora in Diachronic Studies of British and American English Language Varieties
نویسنده
چکیده
Since the 1990s, the Brown ‘family’ corpora have been widely used for various diachronic studies of 20th century English language. However, the existing methodologies failed to exploit its full potential as they only used the four main text categories. In this paper, we present the results of two experiments on diachronic changes of the Coleman-Liau readability Index (CLI) in British and American English in the period 1961–1991/2. The first experiment used all fifteen fine-grained text genres, while the second only used the four main text categories. The comparison of the results of these two experiments demonstrated the importance of using all fifteen finegrained text genres for obtaining a better understanding of how language changes.
منابع مشابه
Diachronic Stylistic Changes in British and American Varieties of 20th Century Written English Language
In this paper we present the results of a study investigating the diachronic changes of four stylistic features: average sentence length, Automated Readability Index, lexical density and lexical richness in 20th century written English language. All experiments were conducted on the largest existing diachronic corpora of British and American English – the Brown ‘family’ corpora, employing NLP t...
متن کاملUsing Comparable Corpora to Track Diachronic and Synchronic Changes in Lexical Density and Lexical Richness
This study from the area of language variation and change is based on exploitation of the comparable diachronic and synchronic corpora of 20th century British and American English language (the ‘Brown family’ of corpora). We investigate recent changes of lexical density and lexical richness in two consecutive thirty-year time gaps in British English (1931–1961 and 1961–1991) and in 1961–1992 in...
متن کاملExploring Male and Female Iranian EFL Learners’ Attitude towards Native and Non-native Varieties of English
This study investigated whether Iranian EFL learners are aware of different varieties of English spoken throughout the world and whether they have tendency towards a particular variety of English. Likewise, it explored the attitudes of Iranian EFL learners towards the native and non-native varieties of English. Moreover, it made an attempt to investigate whether such attitudes are gender-orient...
متن کاملDiachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach
A syntactically complex text may represent a problem for both comprehension by humans and various NLP tasks. A large number of studies in text simplification are concerned with this problem and their aim is to transform the given text into a simplified form in order to make it accessible to the wider audience. In this study, we were investigating what the natural tendency of texts is in 20th ce...
متن کاملQuantitative approaches to diachronic corpus linguistics
English Historical Linguistics has a rich and long-standing tradition of corpus-based work (cf. the surveys in Rissanen 2008, Kytö 2012). Resources such as the HELSINKI corpus, the BROWN family of corpora, and ARCHER have spawned active research programs for the study of lexical and grammatical change, both long-term (Curzan 2008) and short-term (Mair 2008). In addition, corpus resources inform...
متن کامل